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SPECIFICATION 

TO ALL WHOM IT MAY CONCERN: 

Be it known that we, Mansour Tahemezhaadi, Naperville, Illinois, and 
Michael J. Kirk, Mount Prospect, Illinois, have invented new and useful APPARATUS 
AND METHOD FOR IMPLEMENTING A PACKET BASED TELECONFERENCE 
BRIDGE, of which the following is a specification. 



APPARATUS AND METHOD FOR IMPLEMENTING A PACKET BASED 
TELECONFERENCE BRIDGE 



5 BACKGROUND 

The present invention relates to an apparatus in method for a implementing a packet 
based teleconferencing bridge and, more particularly, to a teleconferencing bridge that avoids 
vocoding for packet-based communications. 
10 In teleconferencing applications, typically a teleconferencing bridge is used to 

combine all received audio sources into a single audio source prior to transmission to 
destination points (i.e., the end user participating in a teleconference call). In the past, most 
i;3 teleconferencing bridges have received audio sources from public switched telephone 
2 network (PSTN) utilizing compressed pulse code modulated (PCM) format. With the rapid 
i 5 deployment, however, of mobile and Intemet telephony, increasingly larger number of 
j!' " conference call participants present their audio source to the teleconferencing bridge in the 
f i] form of a speech packet rather than in a compressed PCM format. Conventional bridges, 
1 1.1 however, simply decode all packets received from mobile or voice over IP (VoIP) users and 
! y sum them to produce a single audio source. Moreover, if there are any PSTN landline callers 
20 coimected to the teleconferencing bridge, their compressed PCM speech samples are 

expanded and subsequently added to the single audio source so as to produce a final single 
audio source, which is, in turn, subsequently encoded for transmission to a packet based 
destination or, alternatively, may be PCM compressed for a landline destination point. The 
packet based destination points (e.g., mobile or Intemet users), however, experience 
25 compromised speech quality due to the decode-encode-decode process, also known as 

tandem vocoding. Additionally, this tandem vocoding increases the round trip delay for the 
transmission of speech packets. 
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Attempts in the prior art to mitigate this compromised speech quality due to tandem 
vocoding have included utilizing a state machine that gives priority to active speech packets 
based on the speech rate of the packet from a multitude of received speech packets. An 
underlying assumption of this methodology is that one conferencing participant will be active 
5 in the span of several speech frames. If all or a number of the parties are talking 

simultaneously, however, the methodology will introduce packet loss. Additionally, voice 
quality in teims of maintaining the continuity of respective background noises when 
switching from one priority party to another may be annoying and lead to listener fatigue. 

Another technique known in the prior art attempts to alleviate the associated problems 

1 40 of the above-described prior art methodology. This technique consists of allowing multiple 

□ 

decoders to run at the destination point and tying each decoder into a separate mobile 

SI 

: supplemental channel (i.e., one decoder per supplemental channel). If a new participant 

'H 

S p wants to join a conference call, however, the infrastructure must free up an additional 

n supplemental chaimel to add to the existing pool of supplemental channels, thereby utilizing 

i 

i .-.iS greater system resources. Furthermore, it is required that all vocoders be of the same type. 

ia 

(3 Hence, this technique in the prior art is very taxing on resources of the infrastructure and has 
less flexibility as all decoders must be of the same type. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 

FIG. 1 illustrates a communication system for multi-party speech connection utilizing 
a teleconferencing bridge constructed in accordance with the teachings of the present 
invention. 

25 FIG. 2 illustrates a decoding device for decoding signals sent from the 

teleconferencing bridge shovm in FIG. 1. 
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FIG. 3 illustrates the method of scheduling audio packets by the decoding device 
shown in FIG. 2. 

FIG. 4 illustrates an overall system diagram showing connections between the 
teleconferencing bridge and various end destinations in accordance with the teachings of the 
present disclosure. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present disclosure provides a multi-party communication system utilizing a 
teleconferencing bridge that obviates the need for tandem vocoding for packet-based 
conference call participants. This is accomphshed by providing a teleconferencing bridge 
that is configured to accept a nimiber of different types of communication signals from 
multiple parties involved in a teleconference call. The teleconferencing bridge, in particular, 
includes one or more tagging devices that are each configured to respectively tag incoming 
conmiunication signals from the multiple parties with predetermined information that is 
particular to the received packet signal. The teleconferencing bridge also includes a 
controller that is configured to combine the tagged signals into one or more data packet 
signals and then subsequently transmit the one or more data packet signals to each of the 
multiple parties involved in the conference call. Additionally, a decoding device for 
decoding the signals sent by the teleconferencing bridge is included for each participant in the 
multi-party teleconferencing call. The decoding devices are configured to utilize the tag 
information in order to direct the packets to an appropriate type of decoding routine for the 
particular packet signal. The use of the tag information obviates the need for decoding and 
subsequent encoding by the teleconferencing bridge, thereby requiring less infrastructure 
resources and, accordingly, maintaining a high level of speech quality. 



FIG. 1 illustrates a communication system 10 wherein a multi-party communication 
may be effected. As shown, a teleconference bridge 18 for multi-party communication (i.e., 
teleconference calls) is configured to accept different types of communication signals 
received from multiple participants over an intermediate transport network 16. The 
participants may include mobile/Voice over IP phone users 12 that transmit packet based 
communication signals to the teleconference bridge 18. Additionally, the teleconference 
bridge 18 may accept signals in pulse code modulated (PCM) format from plain old 
telephone service (POTS) phones 14 over dedicated channels as indicated. 

Within the teleconference bridge 18 are included tagging devices 20 that receive 
packet-based information from the Mobile or Voice over IP users 12. The tagging devices 20 
are configured to tag an incoming signal from a call participant with a tag that may include 
predetermined information particular to that signal. For example, the tag may include 
information concerning the vocoder type needed to decode the packet-based infonnation 
when it is received by the end user having a decoder or a decoding device, which will be 
explained later. Further infonnation in the tag includes a user ID that is used to correlate the 
particular packet with the end user that has originated that particular packet. Other 
infonnation included in the tag may include also the particular network server, particularly in 
the case of Mobile and Voice over IP users, that serves the originating user. 

Once packets have been tagged by the tagging devices 20, they are passed to a 
controller 28 that combines the packets to be sent out back over the intermediate transport 
network 16, as will be described later. For audio signals originating from landUne phone 
users 14, the teleconference bridge 18 includes expanders 22 that are used to expand the 
compressed PCM signal fomaat in order to decode the audio information. The signals 
expanded and decoded by the expanders 22 are delivered to excluding logic 23. Functionally, 



the excluding logic 23 serves to exclude speech information from a particular POTS phone 
user to which a packet is being sent by the teleconference bridge 1 8, thereby precluding the 
POTS phone user 14 from hearing their own speech information. Hence, the excluding logic 
23 serves to exclude and delay or buffer audio information until it is needed for inclusion in 
5 an audio packet to be sent over the intermediate transport network 16. 

As signals are released by the excluding logic 23 they are, in turn, dehvered to a 
summing device 24 that combines the decoded PCM signals into a single audio signal. This 
single audio signal is delivered to an encoder and tagging device 26. The function of the 
encoder and tagging device 26 is to first encode the single audio packet received from the 

hJO summing device 24 into a packet-based audio format. Additionally, the encoder and tagging 

Q 

'■J device 26 also tags the packet with the same type of information as the tagging devices 20 tag 

1 1 packets originating fh)m mobile or IP users, for example. 

i p 

l|j Alternatively, instead of summing the decoded signals from the decoded signals from 

- the expanders 22, each expander 22 may have an respective encoding and tagging device (not 
U 5 shown), similar to encoder and tagging device 26, that encodes each individual signal to a 
Q packet based format and appropriately tags the packet with information concerning the 
particular end user 14 originating the PCM format audio signal. This alternative would 
obviate the need for the summarizing device 24, as tagged signals could be delivered directly 
to the controller 28. 

20 The controller 28 within the teleconference bridge 1 8 serves to assemble the tagged 

packets into a packet for transmission over the intermediate fransport network 16, as well as 
having the function of bandwidth allocation for the packets being sent. Another term for this 
bandwidth allocation is "Bandwidth Brokering" wherein the controller 28 may allocate more 
or less bandwidth to be used dependent on the current state of the intermediate transport 
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network 16. For example, if a high degree of traffic exists on the intermediate transport 
network 16 causing slower transmission delays, the controller 28 may be configured to 
adaptively arrange packets that it receives into a "superpacket" by multiplexing the packet 
signals received into a single large packet of audio information. In cases where the 
5 intermediate transport network 1 6 is not experiencing high traffic volume the controller 28 
may then adaptively send packets received out serially over the intermediate transport 
network 16 without multiplexing these packets into a single packet. The ability of the 
controller 28 to broker the bandwidth used improves the scalability of the infrastructure since 
one single supplemental or dedicated channel in multi-task mode is sent from the 
i; J 0 teleconference bridge 1 8 to the caller participants with a single packet stream. 
!; J An additional function of the controller 28 is to address information sent out to a 

jiy particular end user. Additionally, the audio information that originated from a particular user 

in 

I i=i to which a packet is being addressed is deleted or omitted from the transmitted package by 

i:3 controller 28 such that the end user will not receive and, thus, hear their own transmitted 

111 

r 4 5 audio information on their speaker. Accordingly, separate audio communication packets are 
sent to each respective user cormection to the conference call and respectively omitting their 
own originated audio information. 

Yet another function of the controller 28 is to track the number of participants 
currently connected to a multi-party call. This is accomplished by creating a call control 

20 table that tracks the number of participants to determine when existing participants drop out 
of the current multi-party call and when new participants join the call. The controller 28 then 
relays the information contained within the control table to each of the users over the 
supplemental channel transmitted over via the intermediate transport network 16. The 
control call table may utihze any known protocol that is used to facilitate messaging between 
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a decoding device 30 (to be discussed later) and the infrastructure bridge 1 8 to inform the 
decoding device 30 of various types of information such as vocoder type, authentication, or a 
call tear down (i.e., when a mobile leaves the conference, thereby freeing up resources or 
addition of a participant). 

Still another function of the controller 28 is to decode the packet based signals having 
a POTS phone or PSTN destination. The controller 28 then encodes the signal into a 
compressed PCM format that is transmitted by a dedicated channel to a POTS Phone 14 or 
PSTN as indicated. 

In FIG. 2, an example of a configuration used by an end user in accordance with the 
teachings of the present invention is illustrated. It is noted that this configuration is the same 
for each user connected to a multi-party conference call. As shown, each end user has an 
associated decoding device 30. The decoding device 30 receives the packet information 
transmitted by the teleconferencing infrastructure 1 8 over the intermediate transport network 
16. A De-tagging and Scheduling device 32 is configured to receive the packet information. 
The De-tagging and Scheduling device 32 first reads the information contained in the tags for 
each of the tag packets that have been sent by the infrastructure 1 8. Additionally, the De- 
tagging and Scheduling device 32 receives information provided by the confroUer 28 that 
informs the decoding device 30 of decoder types that will need to be run. Thus, if the 
decoding device 30 does not support a particular decoder type, communication may be sent 
by the decoding device back to the controller 28 over the intermediate transport network 16 
whereupon the controller 16 initiates a fiirther fimction of decoding packet types not 
recognized by the decoding device 30 for that particular end user having that decoding device 
30. 



Another function of the De-tagging and Scheduling device 32 is to determine a 
queuing order of the received packets received in a predetermined samphng period in which 
the packets are sent sequentially to various decoders 34 within the decoding device 30. 
Concomitant with this function is the need to address which of the decoders a particular 
5 packet is to be sent based on the vocoder type for that particular signal. For example, a signal 
that originates from a mobile telephone user in a packet-based format will require an 
appropriate decoder that is configured to decode the packet-based information of that 
particular mobile telephone type. Each of the decoders 34 as shown in FIG. 2 decode a 
corresponding type of signal according to some predetermined decoding algorithm. 

;,I10 An additional feature of the decoders 34 is that they are configured to decode multiple 

signals of the same type but having different parameters. That is, a decoder having an 

, algorithm to decode a particular vocoding type from a mobile user, for example, may be 

utilized to decode audio information received from a nimiber of additional users coimected to 
the conference call having mobile telephones issuing the same vocoder type of signals. In 
1 5 order to achieve this function, the decoding algorithms are configured to include multiple 
predetermined thread logic routines wherein each thread logic routine is assigned to a 
particular user originated signal and the signal's corresponding parameters whereas another 
thread logic routine would be used for another originating user that executes the same 
algorithm, but with that particular user's parameters. 
20 Finally, the decoded signals output from the decoders 34 within the decoding device 

30 are input into a simiming device 36 to combine the decoded signals. Accordingly, a signal 
is output to a speaker or listening device (not shown) of the end user wherein the end user is 
allowed to hear the audio information from all other participants in the multi-party 
conference call. 
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As mentioned previously, the De-tagging and Scheduling device 32 establishes a 
queue for decoding the incoming signals. In order to accomplish this scheduling, a task 
scheduling algorithm is utilized within the De-tagging and Scheduhng device 32 wherein the 
signals to be decoded are ordered according to the speech rates of the received packets. 
5 Preferably, those packets having higher speech rates are given a higher priority in the queue 
whereas packets with lower speech rates are given lower priority during a given sampling 
period. 

An example of this scheduling is illustrated in FIG. 3. As shown, a received packet 
40 that is either transmitted serially, as shown, or was multiplexed into a "superpacket" 
go format is input to the task scheduling algorithm in the De-tagging and Scheduling device 32. 
;;1 The illustrated packet 40 in the example of FIG. 3 contains four audio information packets 

i;ri 

j Ij 42-45 originating from four different users from the user receiving this. Each packet contains 

m 

[ft a corresponding tag that was attached at the infrastructure 18. 

5 3 As shown in FIG. 3 the speech rate of each packet is also shovm. Determination of 

i j 5 the packet speech rate is accomplished by the De-tagging and Scheduling device 32 when the 
j;:! packet is received. Particularly for the wireless CDMA domain, variable rate vocoders are 
utilized having determined discrete values of frill rate (e.g., 8 kilobits per second), half rate 
(e.g., 4 kilobits per second), 1/4 rate (e.g., 2 kilobits per second) and 1/8 rate (e.g., 1 kilobit 
per second). For periods when the originating user is inputting a high level of audio energy 
20 into the system, full rate encoding/decoding is used whereas during periods of silence or 

pauses in speech a lower rate such as 1/8 rate, for example, would be selected by the vocoder. 
A shown in the example of FIG. 3, the packets 42 and 44, which are received first and third in 
order, were encoded at full rate. The second received packet 43 was encoded at 1/8 rate and 
the fourth received packet 45 was encoded at 1/4 rate. 
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According to an exemplary algorithm utilized in the De-tagging and Scheduling 
device 32, packets 42-45 are queued according to their data encoding rates. Accordingly, 
packets 42 and 44 , both would receive higher priority than the other lower rate packets 
received. Among packets 42 and 44 having the same data rate, the sequential ordering is 
based on the order received. Accordingly, as shown in the buffer queue 46, packet 42 is 
placed first in the queue 46 and packet 44 placed second. Next, those packets having the next 
highest encoding data rate, namely packet 45, is placed third in the queuing order with packet 
43 being placed last as it has the lowest encoded data rate. The reason for this ordering of 
data rates is to ensure that those packets having the most audio information in a given 
sampling period are encoded first before those packets having less audio information. This 
ensures that the quality of speech is high since the more crucial audio packets having greater 
speech information (as compared to pauses or silence) will have a greater likelihood of being 
encoded and not dropped. 

It is noted that the decoding device 30 is associated with each particular user 
connected to the multi-party conference call. For mobile users and Voice over IP users, the 
decoding device 30 may be configured as hardware or as software that is implemented within 
the respective mobile telephone or Internet phone. For traditional landline based participants, 
the decoding device 30 would be located at the PSTN to which the landline user is connected. 

Since the task scheduling algorithm can be adopted to run the multiple decoders 32 in 
a standard fashion based on the packets' associated speech rates this will ensure higher 
speech quality as well as lower delays. 

An example of an overall configuration of a network utilizing the disclosed 
teleconference bridge 18 is shown. As illusti-ated, a mobile server 50 that serves a number of 
mobile users 52 is connected to the teleconference bridge 18 via an intermediate transport 
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network 54 that is the equivalent of the previously illustrated network 16. Each of the mobile 
users 52 includes a decoding device 30 such as that as illustrated in Fig. 2. With respect to 
landline users 58, a PSTN 56 is connected to the teleconference bridge 18 via a frame pipe 60 
that both transmits and receives PCM type audio information. Accordingly, for transmissions 
from the teleconference bridge 18 to the PSTN 56, the teleconference bridge 18 serves as a 
decoder, preferably accomplished by the controller 28, in order to decode the assembled 
packets and subsequently encode the information to a compressed PCM signal format 
recognizable by the FSTN 56. 

Alternatively, signals having landline user 58 destinations may be transmitted as 
packet based information over the intermediate transport network 54 to a media gateway 62 
as illustrated in Fig. 4. The media gateway 62 is comprised of one or more decoders that 
decode the packet based information and, in turn, encode the signals in a compressed PCM 
forniat that is transmitted to the PSTN 56. The media gateway 62 may be located external to 
the PSTN 56 or maybe contained within the PSTN 56. 

In conclusion, the apparatus and method in accordance with the teachings of the 
present invention as disclosed avoids tandem vocoding for mobile and voice over IP users 
that traditionally was performed by conventional teleconferencing bridges. Accordingly, the 
system resources are not taxed thereby saving cost as well as iacreasing the quality of service 
particularly for mobile and other packet based users of multi-party conferencing systems. 

While the teachings of the invention have been explained with the respect to 
particular examples, it will be apparent to those of ordinary skill in the art that the scope of 
the disclosure is not limited to those examples. On the contrary, this application covers all 
apparatuses and methods falling within the spirit and scope of the appended claims, either 
literally or under the doctrine of equivalents. 
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